Minimum Message Length Segmentation

نویسندگان

  • Jonathan J. Oliver
  • Rohan A. Baxter
  • Chris S. Wallace
چکیده

The segmentation problem arises in many applications in data mining, A.I. and statistics, including segmenting time series, decision tree algorithms and image processing. In this paper, we consider a range of criteria which may be applied to determine if some data should be segmented into two or regions. We develop a information theoretic criterion (MML) for the segmentation of univariate data with Gaussian errors. We perform simulations comparing segmentation methods (MML, AIC, MDL and BIC) and conclude that the MML criterion is the preferred criterion. We then apply the segmentation method to nancial time series data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Kindest Cut: Minimum Message Length Segmentation

We consider some particular instances of the segmentation problem. We derive minimum message length (MML) expressions for stating the region boundaries for some one and two dimensional examples. It is the found the message length cost of stating region boundaries is dependent on the noise of the data in the separated regions and also thèdegree of separation' of the two regions. The framework gi...

متن کامل

Minimum Message Length Grouping of Ordered Data

Explicit segmentation is the partitioning of data into homogeneous regions by specifying cut-points. W. D. Fisher (1958) gave an early example of explicit segmentation based on the minimisation of squared error. Fisher called this the grouping problem and came up with a polynomial time Dynamic Programming Algorithm (DPA). Oliver, Baxter and colleagues (1996,1997,1998) have applied the informati...

متن کامل

Information-Theoretic Image Reconstruction and Segmentation from Noisy Projections

The minimum message length (MML) principle for inductive inference has been successfully applied to image segmentation where the images are modelled by Markov random fields (MRF). We have extended this work to be capable of simultaneously reconstructing and segmenting images that have been observed only through noisy projections. The noise added to each projection depends on the classes of the ...

متن کامل

Speaker change detection using minimum message length criterion

Speaker change detection or speaker-based segmentation is useful and important in many applications, such as transcribing broadcast news or telephone conversations. It usually serves as a preliminary step prior to speech/speaker recognition. Among various methods proposed in the literature, Bayesian Information Criterion (BIC) based method has been widely used. In this paper, we propose to use ...

متن کامل

Change-Point Estimation Using New Minimum Message Length Approximations

This paper investigates the coding of change-points in the information-theoretic Minimum Message Length (MML) framework. Changepoint coding regions affect model selection and parameter estimation in problems such as time series segmentation and decision trees. The Minimum Message Length (MML) and Minimum Description Length (MDL78) approaches to change-point problems have been shown to perform w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998